Semantic Dictionary Encoding
falvotech.com·2h·
Discuss: Hacker News
🗂️Type Indexing
Learn How to Use Transformers with HuggingFace and SpaCy
towardsdatascience.com·3h
📊Pratt Parsers
Topological Sort: Managing Mutable Structures in Haskell
mmhaskell.com·8h
🪢Rope Data Structures
Show HN: Semlib – Semantic Data Processing
github.com·3h·
Discuss: Hacker News
🔍ML Language
RustGPT: A pure-Rust transformer LLM built from scratch
dev.to·4h·
Discuss: DEV
🏗️Cranelift
Announcing datalit: A macro to generate fluent, readable static binary data
reddit.com·20h·
Discuss: r/rust
🦀Rust Macros
HANRAG: Heuristic Accurate Noise-resistant Retrieval-Augmented Generation for Multi-hop Question Answering
arxiv.org·13h
🎲Parser Fuzzing
Which NPM package has the largest version number?
adamhl.dev·14h·
Discuss: Hacker News
📦Monorepos
I didn’t know these Excel functions existed, but now I can’t live without them
makeuseof.com·3h
📝Editor Buffers
Language Models Pack Billions of Concepts into 12,000 Dimensions
nickyoder.com·13h·
🌱Minimal ML
A (Nearly) Branchless RESP Request Parser
kevinmontrose.com·5h
🔧Error Recovery
ECMAScript TC39 proposal-pattern-matching
github.com·3h·
Discuss: Hacker News
🎯Pattern Matching
[1] Algorithm Showdown: Python vs. JavaScript - Group Anagrams
dev.to·20h·
Discuss: DEV
🔗Hash Functions
Challenges You Will Face When Parsing PDFs with Python
theseattledataguy.com·2h·
Discuss: Hacker News
🚀Tokenizer Performance
LLM Rerankers for RAG: A Practical Guide
fin.ai·19h·
🪜Recursive Descent
I Vibe Coded an R Package
jcarroll.com.au·2d·
💬Interactive REPLs
OTW - Bandit Level 4 to Level 5
tbhaxor.com·11h
📦Executable Size
Ancient Scripts, Modern AI: Bridging the Divide with Morphology-Aware Tokenization by Arvind Sundararajan
dev.to·1d·
Discuss: DEV
Tokenizer Benchmarks
Weighted random generation in Python (2010)
eli.thegreenplace.net·19h·
Discuss: Hacker News
⏭️Skip Lists